From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Baurjan Ismagulov Newsgroups: gmane.emacs.devel Subject: Re: adding a new encoding Date: Sun, 20 Jun 2004 18:35:14 +0200 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: <20040620163514.GA3576@ata.cs.hun.edu.tr> References: <20040619150301.GC3467@ata.cs.hun.edu.tr> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1087749253 1734 80.91.224.253 (20 Jun 2004 16:34:13 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sun, 20 Jun 2004 16:34:13 +0000 (UTC) Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Sun Jun 20 18:34:03 2004 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1Bc5Gd-0007BK-00 for ; Sun, 20 Jun 2004 18:34:03 +0200 Original-Received: from lists.gnu.org ([199.232.76.165]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1Bc5Gd-0000iH-00 for ; Sun, 20 Jun 2004 18:34:03 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1Bc5Hp-0001yT-Sc for emacs-devel@quimby.gnus.org; Sun, 20 Jun 2004 12:35:17 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.33) id 1Bc5Hg-0001x3-5q for emacs-devel@gnu.org; Sun, 20 Jun 2004 12:35:08 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.33) id 1Bc5He-0001wi-GP for emacs-devel@gnu.org; Sun, 20 Jun 2004 12:35:07 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1Bc5He-0001wY-Ca for emacs-devel@gnu.org; Sun, 20 Jun 2004 12:35:06 -0400 Original-Received: from [82.83.206.115] (helo=burundai.radix50.net) by monty-python.gnu.org with esmtp (TLSv1:DES-CBC3-SHA:168) (Exim 4.34) id 1Bc5GL-0007Yi-DF for emacs-devel@gnu.org; Sun, 20 Jun 2004 12:33:45 -0400 Original-Received: from burundai.radix50.net (localhost [127.0.0.1]) by burundai.radix50.net (8.12.3/8.12.3/Debian -4) with ESMTP id i5KGZE3B003625 for ; Sun, 20 Jun 2004 18:35:14 +0200 Original-Received: (from ibr@localhost) by burundai.radix50.net (8.12.3/8.12.3/Debian -4) id i5KGZEef003624 for emacs-devel@gnu.org; Sun, 20 Jun 2004 18:35:14 +0200 Original-To: emacs-devel@gnu.org Mail-Followup-To: emacs-devel@gnu.org Content-Disposition: inline; filename=mutt4EvYRt In-Reply-To: <20040619150301.GC3467@ata.cs.hun.edu.tr> User-Agent: Mutt/1.5.6i X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.4 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:25122 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:25122 Hello, On Sat, Jun 19, 2004 at 05:03:01PM +0200, Baurjan Ismagulov wrote: > I want to use Emacs to edit text in PT154 encoding. Here's what I've understood (please correct me if anything is wrong): * Emacs uses ISO 2022 internally. * Supported Cyrillic encodings map to ISO 8859-5. * PT154 (http://www.iana.org/assignments/charset-reg/PTCP154) cannot be mapped to ISO 8859-5 due to at least 2x18 characters not present in the latter. * One possible solution would be to define-charset cyrillic-asian. However, the number of characters in a new charset is limited to 94 or 96, and there are 108 non-us-ascii letters in PT154. So, the questions: * Can I define cyrillic-asian with 36 characters and use it together with us-ascii and cyrillic-iso8859-5, or should I define cyrillic-asian-1 and cyrillic-asian-2, duplicating 2x(33+3) chars already present in cyrillic-iso8859-5? * What to do with the final char? These charsets are not registered in ECMA. Thanks in advance, Baurjan.