From mboxrd@z Thu Jan 1 00:00:00 1970 From: Simon Guest <simon.guest@tesujimath.org> Subject: Implemented word count for subtrees Date: Sat, 23 Apr 2011 20:57:14 +1200 Message-ID: <867hal4a6t.wl%simon.guest@tesujimath.org> Mime-Version: 1.0 (generated by SEMI 1.14.6 - "Maruoka") Content-Type: text/plain; charset=US-ASCII Return-path: <emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org> Received: from eggs.gnu.org ([140.186.70.92]:35807) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from <simon.guest@tesujimath.org>) id 1QDYeY-0001mJ-UC for emacs-orgmode@gnu.org; Sat, 23 Apr 2011 04:57:23 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from <simon.guest@tesujimath.org>) id 1QDYeX-0006Kk-J6 for emacs-orgmode@gnu.org; Sat, 23 Apr 2011 04:57:22 -0400 Received: from unit0.ironport.snap.net.nz ([202.37.100.104]:45142) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <simon.guest@tesujimath.org>) id 1QDYeW-0006KX-OB for emacs-orgmode@gnu.org; Sat, 23 Apr 2011 04:57:21 -0400 Received: from aji.tesujimath.org (233.253.69.111.dynamic.snap.net.nz [111.69.253.233]) by rupert.snap.net.nz (Postfix) with ESMTPS id 6E977201AD for <emacs-orgmode@gnu.org>; Sat, 23 Apr 2011 20:57:15 +1200 (NZST) Received: from sabaki.local ([10.0.1.242] helo=sabaki.tesujimath.org.tesujimath.org) by aji.tesujimath.org with esmtp (Exim 4.72) (envelope-from <simon.guest@tesujimath.org>) id 1QDYeR-0005Xv-7X for emacs-orgmode@gnu.org; Sat, 23 Apr 2011 20:57:15 +1200 List-Id: "General discussions about Org-mode." <emacs-orgmode.gnu.org> List-Unsubscribe: <https://lists.gnu.org/mailman/options/emacs-orgmode>, <mailto:emacs-orgmode-request@gnu.org?subject=unsubscribe> List-Archive: </archive/html/emacs-orgmode> List-Post: <mailto:emacs-orgmode@gnu.org> List-Help: <mailto:emacs-orgmode-request@gnu.org?subject=help> List-Subscribe: <https://lists.gnu.org/mailman/listinfo/emacs-orgmode>, <mailto:emacs-orgmode-request@gnu.org?subject=subscribe> Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-org list <emacs-orgmode@gnu.org> Dear Org mode people, I implemented word counting for Org mode sub-trees. That is, count each sub-tree, and accumulate totals into the parent heading lines. Others have asked about this, so I attach my code below. I started with Paul Sexton's code posted to the list on 21/2/11. I had some different requirements, so I hacked this mercilessly. Sorry, Paul. I was most concerned about speed, so I removed any check that caused repeated hunting around in the org mode buffer - all the contextual stuff. I also skip heading lines, as I didn't want them in my total. (I'm using the wonderful Org mode to write a novel, and the heading lines are for my organisation only, not part of the text.) Anyway, here it is. I use this key-binding in my .emacs. (define-key org-mode-map "\C-c\C-xw" 'org-wc-display) By the way, it complains if you call it without mark being set. I want to use (interactive "r") to handle regions, but don't know how to handle this error case. Suggestions welcome. cheers, Simon ;; org-wc.el ;; ;; Count words in org mode trees. ;; Shows word count per heading line, summed over sub-headings. ;; Aims to be fast, so doesn't check carefully what it's counting. ;-) ;; ;; Simon Guest, 23/4/11 ;; ;; Implementation based on: ;; - Paul Sexton's word count posted on org-mode mailing list 21/2/11. ;; - clock overlays (defun org-in-heading-line () "Is point in a line starting with `*'?" (equal (char-after (point-at-bol)) ?*)) (defun org-word-count (beg end) "Report the number of words in the Org mode buffer or selected region." (interactive "r") (unless mark-active (setf beg (point-min) end (point-max))) (let ((wc (org-word-count-aux beg end))) (message (format "%d words in %s." wc (if mark-active "region" "buffer"))))) (defun org-word-count-aux (beg end) "Report the number of words in the selected region. Ignores: heading lines, blocks, comments, drawers. LaTeX macros are counted as 1 word." (let ((wc 0) (block-begin-re "^#\\\+BEGIN") (block-end-re "^#\\+END") (latex-macro-regexp "\\\\[A-Za-z]+\\(\\[[^]]*\\]\\|\\){\\([^}]*\\)}") (drawers-re (concat "^[ \t]*:\\(" (mapconcat 'regexp-quote org-drawers "\\|") "\\):[ \t]*$")) (drawers-end-re "^[ \t]*:END:")) (save-excursion (goto-char beg) (while (< (point) end) (cond ;; Ignore heading lines. ((org-in-heading-line) (forward-line)) ;; Ignore blocks. ((looking-at block-begin-re) (re-search-forward block-end-re)) ;; Ignore comments. ((org-in-commented-line) (forward-line)) ;; Ignore drawers. ((looking-at drawers-re) (re-search-forward drawers-end-re nil t)) ;; Count latex macros as 1 word, ignoring their arguments. ((save-excursion (backward-char) (looking-at latex-macro-regexp)) (goto-char (match-end 0)) (setf wc (+ 2 wc))) (t (progn (re-search-forward "\\w+\\W*") (incf wc)))))) wc)) (defun org-wc-count-subtrees () "Count words in each subtree, putting result as the property :org-wc on that heading." (interactive) (remove-text-properties (point-min) (point-max) '(:org-wc t)) (save-excursion (goto-char (point-max)) (while (outline-previous-heading) (org-narrow-to-subtree) (let ((wc (org-word-count-aux (point-min) (point-max)))) (put-text-property (point) (point-at-eol) :org-wc wc) (goto-char (point-min)) (widen))))) (defun org-wc-display (beg end total-only) "Show subtree word counts in the entire buffer. With prefix argument, only show the total wordcount for the buffer or region in the echo area. Use \\[org-wc-remove-overlays] to remove the subtree times. Ignores: heading lines, blocks, comments, drawers. LaTeX macros are counted as 1 word." (interactive "r\nP") (org-wc-remove-overlays) (unless total-only (let (wc p) (org-wc-count-subtrees) (save-excursion (goto-char (point-min)) (while (or (and (equal (setq p (point)) (point-min)) (get-text-property p :org-wc)) (setq p (next-single-property-change (point) :org-wc))) (goto-char p) (when (setq wc (get-text-property p :org-wc)) (org-wc-put-overlay wc (funcall outline-level)))) ;; Arrange to remove the overlays upon next change. (when org-remove-highlights-with-change (org-add-hook 'before-change-functions 'org-wc-remove-overlays nil 'local))))) (if mark-active (org-word-count beg end) (org-word-count (point-min) (point-max)))) (defvar org-wc-overlays nil) (make-variable-buffer-local 'org-wc-overlays) (defun org-wc-put-overlay (wc &optional level) "Put an overlays on the current line, displaying word count. If LEVEL is given, prefix word count with a corresponding number of stars. This creates a new overlay and stores it in `org-wc-overlays', so that it will be easy to remove." (let* ((c 60) (l (if level (org-get-valid-level level 0) 0)) (off 0) ov tx) (org-move-to-column c) (unless (eolp) (skip-chars-backward "^ \t")) (skip-chars-backward " \t") (setq ov (make-overlay (1- (point)) (point-at-eol)) tx (concat (buffer-substring (1- (point)) (point)) (make-string (+ off (max 0 (- c (current-column)))) ?.) (org-add-props (format "%s" (number-to-string wc)) (list 'face 'org-wc-overlay)) "")) (if (not (featurep 'xemacs)) (overlay-put ov 'display tx) (overlay-put ov 'invisible t) (overlay-put ov 'end-glyph (make-glyph tx))) (push ov org-wc-overlays))) (defun org-wc-remove-overlays (&optional beg end noremove) "Remove the occur highlights from the buffer. BEG and END are ignored. If NOREMOVE is nil, remove this function from the `before-change-functions' in the current buffer." (interactive) (unless org-inhibit-highlight-removal (mapc 'delete-overlay org-wc-overlays) (setq org-wc-overlays nil) (unless noremove (remove-hook 'before-change-functions 'org-wc-remove-overlays 'local)))) (provide 'org-wc)