From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Miles Bader Newsgroups: gmane.emacs.devel Subject: Re: Bug 130397 Date: Sat, 8 Jan 2005 22:29:21 +0900 Message-ID: References: <28878.1105029010@ichips.intel.com> Reply-To: snogglethorpe@gmail.com, miles@gnu.org NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1105298812 8095 80.91.229.6 (9 Jan 2005 19:26:52 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sun, 9 Jan 2005 19:26:52 +0000 (UTC) Cc: Geoff Kuenning , 130397@bugs.debian.org, agustin.martin@hispalinux.es, lionel@mamane.lu, Kenichi Handa , emacs-devel@gnu.org, juri@jurta.org, Ken Stevens , Stefan Monnier Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Jan 09 20:26:41 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1Cnii1-0006yI-00 for ; Sun, 09 Jan 2005 20:26:41 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1CnitT-0005mx-T9 for ged-emacs-devel@m.gmane.org; Sun, 09 Jan 2005 14:38:31 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1CnH39-0008G2-VG for emacs-devel@gnu.org; Sat, 08 Jan 2005 08:54:44 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1CnH2x-00089b-49 for emacs-devel@gnu.org; Sat, 08 Jan 2005 08:54:27 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1CnH2w-000836-KJ for emacs-devel@gnu.org; Sat, 08 Jan 2005 08:54:26 -0500 Original-Received: from [64.233.184.204] (helo=wproxy.gmail.com) by monty-python.gnu.org with esmtp (Exim 4.34) id 1CnGeg-0000Bm-4a for emacs-devel@gnu.org; Sat, 08 Jan 2005 08:29:22 -0500 Original-Received: by wproxy.gmail.com with SMTP id 49so873804wri for ; Sat, 08 Jan 2005 05:29:21 -0800 (PST) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:references; b=VWMrGju+ySzNRdWGvcje9w/dks88jTzsQr+dbKcnKyxBDjHU7Ho66f27k7H0JCs9YA4rHpYQkp8mHtbRFTqSkMSvwN2ckNL2wnz437ImMOb1hJoL4v4orpbXdWds/YYMqvnxmMUTP2IEJ+w1pOpnX0fQDV+yYccaYV7UMGhTMtU= Original-Received: by 10.54.59.36 with SMTP id h36mr276201wra; Sat, 08 Jan 2005 05:29:21 -0800 (PST) Original-Received: by 10.54.19.32 with HTTP; Sat, 8 Jan 2005 05:29:21 -0800 (PST) Original-To: David Kastrup In-Reply-To: X-Mailman-Approved-At: Sun, 09 Jan 2005 14:38:21 -0500 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:32075 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:32075 > You are wrongly assuming that the buffer is maintained in UTF-8. It > isn't. Byte indexing is not going to be fun with regard to > efficiency, unless we get some interface that will, while writing out > a file in UTF-8, store an array of byte/character correspondences for > the UTF-8 (or whatever other) character conversion somewhere. Er, does efficiency matter all that much when parsing output from ispell -a? After all, it's feeding the input line-by-line to ispell (judging from man page), and you only have to actually deal with offsets for lines with mispellings -- which are the minority, and result in user interaction anyway, which will tend to hide any sort of slight inefficiency. If ispell wants utf-8, it's easy enough to convert each input line to utf-8 and deal with offsets into that in the event of a mispelling; even if emacs has to process the line character by character to do it, it seems like it would be fast enough. For the great bulk of the buffer without any mispellings, you won't incur the inefficiency, which is what really matters. -Miles