From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel Subject: Re: Displaying bytes (was: Inadequate documentation of silly Date: Sun, 29 Nov 2009 11:31:55 -0500 Message-ID: References: <20091119155848.GB1314@muc.de> <87aayiihe9.fsf@lola.goethe.zz> <20091119180848.GE1314@muc.de> <47325.130.55.118.19.1258658705.squirrel@webmail.lanl.gov> <20091119212550.GG1314@muc.de> <87my2ign8u.fsf@lola.goethe.zz> <912155b0911231334s2b52e8eq864251c9aed386b3@mail.gmail.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1259512339 24133 80.91.229.12 (29 Nov 2009 16:32:19 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 29 Nov 2009 16:32:19 +0000 (UTC) Cc: per.starback@gmail.com, dak@gnu.org, emacs-devel@gnu.org, Kenichi Handa To: rms@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Nov 29 17:32:11 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1NEmh1-0008HR-KF for ged-emacs-devel@m.gmane.org; Sun, 29 Nov 2009 17:32:11 +0100 Original-Received: from localhost ([127.0.0.1]:45382 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NEmh0-0006D3-W3 for ged-emacs-devel@m.gmane.org; Sun, 29 Nov 2009 11:32:11 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1NEmgv-0006CL-2N for emacs-devel@gnu.org; Sun, 29 Nov 2009 11:32:05 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1NEmgq-0006B8-Al for emacs-devel@gnu.org; Sun, 29 Nov 2009 11:32:04 -0500 Original-Received: from [199.232.76.173] (port=36524 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NEmgq-0006B4-6Y for emacs-devel@gnu.org; Sun, 29 Nov 2009 11:32:00 -0500 Original-Received: from ironport2-out.teksavvy.com ([206.248.154.183]:18691 helo=ironport2-out.pppoe.ca) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1NEmgm-0006p9-S4; Sun, 29 Nov 2009 11:31:56 -0500 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AroEAEYxEktFpYc//2dsb2JhbACBTdF+hDEEgyCGeQ X-IronPort-AV: E=Sophos;i="4.47,308,1257138000"; d="scan'208";a="50281960" Original-Received: from 69-165-135-63.dsl.teksavvy.com (HELO ceviche.home) ([69.165.135.63]) by ironport2-out.pppoe.ca with ESMTP; 29 Nov 2009 11:31:55 -0500 Original-Received: by ceviche.home (Postfix, from userid 20848) id 8082BB40E6; Sun, 29 Nov 2009 11:31:55 -0500 (EST) In-Reply-To: (Richard Stallman's message of "Sun, 29 Nov 2009 11:01:21 -0500") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1.50 (gnu/linux) X-detected-operating-system: by monty-python.gnu.org: Genre and OS details not recognized. X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:117927 Archived-At: > If it turns out that windows-1252 files are the main cause of > 8-bit-control characters in the buffer, here's another idea. It may be the case for some users, but it probably isn't the case in general. It's clearly not the case for me (I only/mostly see such characters in Gnus when I receive email that is improperly labelled, where I'm happy to see tham so that I complain to their originator). > Here's another idea. We could employ some heuristics to see if the > distribution of those characters seems typical for the way those > characters are used. For instance, some of the punctuation characters Using such heursitics might be a good idea in general to automatically detect which encoding is used, or which language is used. As time passes, it becomes less and less important for coding-systems in my experience (utf-8 and utf-16 seem to slowly take over and we already auto-detect them well). Stefan