From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Chong Yidong Newsgroups: gmane.emacs.devel Subject: Re: size of emacs executable after unicode merge Date: Mon, 10 Nov 2008 10:18:20 -0500 Message-ID: <87d4h3ip8j.fsf@cyd.mit.edu> References: <200805140351.m4E3pQuE004549@sallyv1.ics.uci.edu> <200805141652.m4EGqikr018644@sallyv1.ics.uci.edu> <200805151529.m4FFTlF1004684@sallyv1.ics.uci.edu> <482D8435.6060407@gnu.org> <20081030101819.GA15223@orion.lan> <200810311507.m9VF7EAl022755@mothra.ics.uci.edu> <873ai7t7fx.fsf@cyd.mit.edu> <87iqqwk672.fsf@cyd.mit.edu> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1226330345 27025 80.91.229.12 (10 Nov 2008 15:19:05 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 10 Nov 2008 15:19:05 +0000 (UTC) Cc: rms@gnu.org, emanuele.giaquinta@gmail.com, emacs-devel@gnu.org, dann@ics.uci.edu, monnier@iro.umontreal.ca, evilborisnet@netscape.net, jasonr@gnu.org To: Kenichi Handa Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Nov 10 16:20:05 2008 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1KzYYX-0000TI-FA for ged-emacs-devel@m.gmane.org; Mon, 10 Nov 2008 16:19:57 +0100 Original-Received: from localhost ([127.0.0.1]:37408 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1KzYXP-0008Vn-Hx for ged-emacs-devel@m.gmane.org; Mon, 10 Nov 2008 10:18:47 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1KzYWr-0008TT-JD for emacs-devel@gnu.org; Mon, 10 Nov 2008 10:18:13 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1KzYWq-0008TG-2T for emacs-devel@gnu.org; Mon, 10 Nov 2008 10:18:13 -0500 Original-Received: from [199.232.76.173] (port=40382 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1KzYWp-0008TC-Tx for emacs-devel@gnu.org; Mon, 10 Nov 2008 10:18:11 -0500 Original-Received: from cyd.mit.edu ([18.115.2.24]:41821) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1KzYWn-0004Wg-K9; Mon, 10 Nov 2008 10:18:09 -0500 Original-Received: by cyd.mit.edu (Postfix, from userid 1000) id 869A657E1BA; Mon, 10 Nov 2008 10:18:20 -0500 (EST) In-Reply-To: (Kenichi Handa's message of "Mon, 10 Nov 2008 10:59:27 +0900") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.60 (gnu/linux) X-detected-operating-system: by monty-python.gnu.org: Genre and OS details not recognized. X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:105534 Archived-At: Kenichi Handa writes: > map-charset-chars calls FUNCTION (modify-category-entry in > the above case) on all characters in CHARSET. But, to know > which characters belongs to CHARET (chinese-gb2312 in the > above case), we must consult with > "etc/charsets/GB2312.map". The contents is something like > this: > > 0x2121-0x2123 0x3000 > 0x2124 0x30FB > 0x2125 0x02C9 > [...] > > From this file, we know that #x3000, #x3001, #x3002, #x30FB, > #x02C9, ... belong to chinese-gb2312. > > We must find a way to make map-charset-chars work without > loading that map into a char-table. > > One idea is to have a single boolean vector of size #x110000 > (139264 bytes), setup it for CHARSET everytime when we call > map-charset-chars for the different charset. In that > vector, only the bit for #x3000, #x3001, #x3002, etc are 1 > for chinese-gb2312. Then map-charset-chars can know for > which characters FUNCTION must be called. Good idea. But, is this sufficient? Do you know whether map-charset-chars the only thing during loadup that loads these char-tables?