From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel Subject: Re: Language identification Date: Fri, 28 Aug 2009 00:58:42 -0400 Message-ID: References: <87skfczqc8.fsf@mail.jurta.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1251435572 9339 80.91.229.12 (28 Aug 2009 04:59:32 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 28 Aug 2009 04:59:32 +0000 (UTC) Cc: joakim@verona.se, Emacs Development To: Juri Linkov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Aug 28 06:59:25 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MgtYb-00062B-5g for ged-emacs-devel@m.gmane.org; Fri, 28 Aug 2009 06:59:25 +0200 Original-Received: from localhost ([127.0.0.1]:47325 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MgtYa-0004kw-JS for ged-emacs-devel@m.gmane.org; Fri, 28 Aug 2009 00:59:24 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1MgtY2-0004cS-Bw for emacs-devel@gnu.org; Fri, 28 Aug 2009 00:58:50 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1MgtXx-0004bG-6w for emacs-devel@gnu.org; Fri, 28 Aug 2009 00:58:49 -0400 Original-Received: from [199.232.76.173] (port=60064 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MgtXx-0004bA-2q for emacs-devel@gnu.org; Fri, 28 Aug 2009 00:58:45 -0400 Original-Received: from ironport2-out.pppoe.ca ([206.248.154.182]:51350 helo=ironport2-out.teksavvy.com) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1MgtXw-0006NY-JC for emacs-devel@gnu.org; Fri, 28 Aug 2009 00:58:44 -0400 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AgcFAHsAl0pFpYuS/2dsb2JhbACBU9Z5gjGBaAWHZA X-IronPort-AV: E=Sophos;i="4.44,289,1249272000"; d="scan'208";a="44380833" Original-Received: from 69-165-139-146.dsl.teksavvy.com (HELO ceviche.home) ([69.165.139.146]) by ironport2-out.teksavvy.com with ESMTP; 28 Aug 2009 00:57:46 -0400 Original-Received: by ceviche.home (Postfix, from userid 20848) id 12183B40F3; Fri, 28 Aug 2009 00:58:42 -0400 (EDT) In-Reply-To: <87skfczqc8.fsf@mail.jurta.org> (Juri Linkov's message of "Fri, 28 Aug 2009 03:27:35 +0300") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1.50 (gnu/linux) X-detected-operating-system: by monty-python.gnu.org: Genre and OS details not recognized. X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:114720 Archived-At: >> I often wish that files would open in Emacs with correct mode >> more often when there is no file extension. > In `auto-mode-alist' you can see that with the exception of > `archive-mode', `doc-view-mode' and `image-mode', all remaining > modes are programming text modes. It would be more useful > to identify file types for these modes that libmagic can't do. > Do you know a library that identifies programming languages? > Such a library might be implemented using a Bayesian classifier > trained on a sufficiently large corpus of different programming > languages. OTOH, how often do you see a file containg programming language code and yet without ny extension? Stefan