From mboxrd@z Thu Jan 1 00:00:00 1970 From: ludo@gnu.org (Ludovic =?utf-8?Q?Court=C3=A8s?=) Subject: Re: [PATCH python-tests] gnu: python-2.7: Enable UCS-4 Unicode encoding. Date: Tue, 24 Jan 2017 22:08:10 +0100 Message-ID: <87d1fc6u9x.fsf@gnu.org> References: <20170122233159.2622-1-dannym@scratchpost.org> <87sho968o5.fsf@kirby.i-did-not-set--mail-host-address--so-tickle-me> <87r33tl7qb.fsf@gnu.org> <20170124004604.4ba3ad2c@scratchpost.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:48197) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cW8KA-0004iR-IY for guix-devel@gnu.org; Tue, 24 Jan 2017 16:08:21 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cW8K5-00066J-Ld for guix-devel@gnu.org; Tue, 24 Jan 2017 16:08:18 -0500 In-Reply-To: <20170124004604.4ba3ad2c@scratchpost.org> (Danny Milosavljevic's message of "Tue, 24 Jan 2017 00:46:04 +0100") List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+gcggd-guix-devel=m.gmane.org@gnu.org Sender: "Guix-devel" To: Danny Milosavljevic Cc: guix-devel@gnu.org Hi Danny, Danny Milosavljevic skribis: >> > Otherwise LGTM. I checked some other distros and they seem to have >> > this enabled. Thanks! >> That means that strings are internally UCS-4-encoded, right? >> What=E2=80=99s the rationale, and what happens when this flag is omitted? > > The CPython C interface changes depending on the flag and some Python > extensions don't work with the narrow UTF-16 Unicode - which is what > it would use if you don't specify. > > The default, UTF-16, is basically just historical baggage from when > Unicode had fewer than 65536 codepoints in the standard. [...] Thanks for the explanation, it makes a lot of sense! Ludo=E2=80=99.