From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: tomas@tuxteam.de Newsgroups: gmane.emacs.devel Subject: Re: Displaying bytes (was: Inadequate documentation of silly Date: Mon, 30 Nov 2009 07:05:36 +0100 Message-ID: <20091130060536.GB21880@tomas> References: <87my2ign8u.fsf@lola.goethe.zz> <912155b0911231334s2b52e8eq864251c9aed386b3@mail.gmail.com> <87d431f2uy.fsf@mail.jurta.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; x-action=pgp-signed Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1259561634 12043 80.91.229.12 (30 Nov 2009 06:13:54 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 30 Nov 2009 06:13:54 +0000 (UTC) Cc: dak@gnu.org, rms@gnu.org, Kenichi Handa , per.starback@gmail.com, emacs-devel@gnu.org, Stefan Monnier To: Juri Linkov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Nov 30 07:13:47 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1NEzW6-0007bP-KY for ged-emacs-devel@m.gmane.org; Mon, 30 Nov 2009 07:13:46 +0100 Original-Received: from localhost ([127.0.0.1]:33385 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NEzW6-0005MQ-11 for ged-emacs-devel@m.gmane.org; Mon, 30 Nov 2009 01:13:46 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1NEzUX-0004f8-E5 for emacs-devel@gnu.org; Mon, 30 Nov 2009 01:12:09 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1NEzUS-0004b7-1g for emacs-devel@gnu.org; Mon, 30 Nov 2009 01:12:08 -0500 Original-Received: from [199.232.76.173] (port=58037 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NEzUR-0004am-Ma for emacs-devel@gnu.org; Mon, 30 Nov 2009 01:12:03 -0500 Original-Received: from alextrapp1.equinoxe.de ([217.22.192.104]:60501 helo=www.elogos.de) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1NEzUM-0004gA-Mn; Mon, 30 Nov 2009 01:11:58 -0500 Original-Received: by www.elogos.de (Postfix, from userid 1000) id 444F09004B; Mon, 30 Nov 2009 07:05:36 +0100 (CET) Content-Disposition: inline In-Reply-To: <87d431f2uy.fsf@mail.jurta.org> User-Agent: Mutt/1.5.15+20070412 (2007-04-11) X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6 (newer, 2) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:117952 Archived-At: -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Mon, Nov 30, 2009 at 12:01:29AM +0200, Juri Linkov wrote: [...] > Unicad (http://www.emacswiki.org/emacs/Unicad) uses statistic models > to auto-detect windows-1252 and many many other coding systems > (auto-detecting windows-1252 is not advertised on the main page, > but actually can be observed in source code). The theory is described > at http://www.mozilla.org/projects/intl/UniversalCharsetDetection.html > I hope sometime this will be added to Emacs. It looks theoretically quite neat. I hope this too -- the current heuristics are often at a loss. Ironically, the cited page at mozilla doesn't display correctly in my browser (of all things mozilla!). Setting to auto-detect guesses UTF-8 whereas it's latin-1 -- as correctly advertised in the headers :-) (yes, it's off-topic and it's most-probably some miscofiguration on my side, but I thought some might savour the irony). But I also feel that we need more systematic heuristics. I'll give Unicad a try. Regards - -- tom=C3=A1s -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) iD8DBQFLE2CwBcgs9XrR2kYRAsCxAJ0cyKl6hp5jN4+N7ogimn354z9+lgCdHAqW REqc68ZeDEqG7eXi7d/HFLU=3D =3DefXE -----END PGP SIGNATURE-----