From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel,gmane.lisp.gcl.devel Subject: Re: [Gcl-devel] utf8 and emacs text/string multibyte representation Date: Fri, 31 Oct 2014 15:52:53 -0400 Message-ID: References: <87wq7jxc7d.fsf@gnu.org> <87zjcfx985.fsf_-_@maguirefamily.org> <877fzhwsls.fsf@maguirefamily.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1414789079 19867 80.91.229.3 (31 Oct 2014 20:57:59 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 31 Oct 2014 20:57:59 +0000 (UTC) Cc: Raymond Toy , gcl-devel@gnu.org, emacs-devel@gnu.org To: Camm Maguire Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Oct 31 21:57:52 2014 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1XkJGS-0008B6-Ka for ged-emacs-devel@m.gmane.org; Fri, 31 Oct 2014 21:57:44 +0100 Original-Received: from localhost ([::1]:43771 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XkJGS-00005V-9f for ged-emacs-devel@m.gmane.org; Fri, 31 Oct 2014 16:57:44 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:40400) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XkIFq-0007UA-TK for emacs-devel@gnu.org; Fri, 31 Oct 2014 15:53:10 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XkIFi-0005mP-EI for emacs-devel@gnu.org; Fri, 31 Oct 2014 15:53:02 -0400 Original-Received: from ironport2-out.teksavvy.com ([206.248.154.181]:16293) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XkIFi-0005mG-BZ; Fri, 31 Oct 2014 15:52:54 -0400 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: Au4MAOatTlRFpY87/2dsb2JhbABcgw6DYoZ+y1MEAgKBHBcBAXyEAwEBAwFWIwULCzQSFBgNJIhLCctyAQEBAQYCAR+RCAeESwWyIIFvhBQhgnoBAQE X-IPAS-Result: Au4MAOatTlRFpY87/2dsb2JhbABcgw6DYoZ+y1MEAgKBHBcBAXyEAwEBAwFWIwULCzQSFBgNJIhLCctyAQEBAQYCAR+RCAeESwWyIIFvhBQhgnoBAQE X-IronPort-AV: E=Sophos;i="5.04,797,1406606400"; d="scan'208";a="95695452" Original-Received: from 69-165-143-59.dsl.teksavvy.com (HELO pastel.home) ([69.165.143.59]) by ironport2-out.teksavvy.com with ESMTP/TLS/DHE-RSA-AES256-SHA; 31 Oct 2014 15:52:53 -0400 Original-Received: by pastel.home (Postfix, from userid 20848) id 39E0462BA; Fri, 31 Oct 2014 15:52:53 -0400 (EDT) In-Reply-To: <877fzhwsls.fsf@maguirefamily.org> (Camm Maguire's message of "Thu, 30 Oct 2014 10:16:15 -0400") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.0.50 (gnu/linux) X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 206.248.154.181 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:176170 gmane.lisp.gcl.devel:8803 Archived-At: > Do these other lisps allocate a fresh character on each aref? Do they > maintain some ~2^21 sized table in core? (And isn't emacs a "lisp" > :-)). I'd expect that dynamically typed languages that have a character type distinct from integers all use an "unboxed" representation for those chars (i.e. reserve some tag-bit combination for the "character" type). IIRC, a unicode char only needs 22bit, so that leaves a lot of space for tagbits. Stefan